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Abstract —In this paper, we propose a novel single-group 
multicasting relay beamforming scheme. We assume a source that 
transmits common messages via multiple ampllfy-and-forward 
relays to multiple destinations. To increase the number of degrees 
of freedom in the beamforming design, the relays process two 
received signals jointly and transmit the Alamouti space-time 
block code over two different beams. Furthermore, in contrast 
to the existing relay multicasting scheme of the literature, we take 
into account the direct links from the source to the destinations. 
We aim to maximize the lowest received quality-of-service by 
choosing the proper relay weights and the ideal distribution of 
the power resources in the network. To solve the corresponding 
optimization problem, we propose an iterative algorithm which 
solves sequences of convex approximations of the original non- 
convex optimization problem. Simulation results demonstrate 
significant performance improvements of the proposed methods 
as compared with the existing relay multicasting scheme of the 
literature and an algorithm based on the popular semidelinite 
relaxation technique. 

Index Terms —Amplify-and-forward, concave-convex program¬ 
ming, multicasting, rank-two beamforming, semidelinite relax¬ 
ation. 

I. INTRODUCTION 

“Beamforming is a versatile and powerful approach to 
receive, transmit, or relay signals-of-interest in a spatially 
selective way in the presence of interference and noise” HI. 
Recently, the concept of receive and transmit beamforming has 
been used to enhance coverage and data rate performance in 
amplify-and-forward (AF) relay networks. Distributed beam¬ 
forming has been applied as a coherent transceiver technique 
in a variety of relay network architectures, such as single-user 
networks Ei-ia. In such networks one source transmits data 
to one destination. Single-user networks have been extended 
to peer-to-peer networks, where multiple source-destination 
pairs communicate directly via relays ||7), H], IfT^ . More¬ 
over, distributed systems of non-connected relays can be used 
for multicasting to transmit data from one source to many 
destinations simultaneously to avoid exhaustive individual 
transmissions a-Ha. 

The task to select the optimum antenna weights in central¬ 
ized beamforming systems using a connected antenna array 
Il3]-l2l, or in distributed beamforming systems using a non- 
connected array a-ini for multicasting is highly non-trivial 
as it requires to form beams towards several destinations, 
each corresponding to a different spatial signature. In Ha, 
it has been shown that the problem of selecting the antenna 
weights for single-group multicasting is NP-hard. Until now, 
no polynomial time algorithm for NP-hard problems is known 
and it is expected that exact solutions can only be computed 
within exponential time. 


To derive adequate solutions to beamforming problems 
which belong to the class of non-convex quadratically con¬ 
strained quadratic optimization problems (QCQPs), computa¬ 
tionally efficient algorithms have been proposed which approx¬ 
imate the feasible set of the optimization problem ISl- lfTOl . 

lEi-iiiii, Ea-iEa. 

Outer approximation techniques replace the QCQPs by 
convex semidelinite programs (SDPs) which can be solved 
efficiently M-M- In the latter approach, the weight vector 
is replaced by a positive semidelinite Hermitian matrix. Since 
this so-called semidelinite relaxation (SDR) technique extends 
the feasible region, a solution matrix to the SDR problem lies 
not necessarily in the feasible set of the original problem. If 
the rank R(X*) of the solution matrix X* equals to one, the 
SDR solution is a global solution to the original problem. 
In practice, however, 7?.(X*) might be greater than one and 
X* is not feasible for the original problem. Especially in 
single-group multicasting scenarios, where many destinations 
demand a minimum received quality-of-service (QoS), it is not 
likely that R(X*) = 1. This is a consequence of the fact that 
the existence of an SDR solution matrix with rank TZ(X*) is 
only guaranteed if R(X*) > 0{\/M), where M is the number 
of destinations llTSll . If 7?.(X*) > 1, the objective value of the 
SDR solution is only a lower bound to the objective value 
of the QCQP The accuracy of the lower bound decreases 
with a growing number of destinations M ifTsll . Therefore, 
for multicasting with large M, the lower bound generated by 
SDR can lie quite far from the true minimum value for rank- 
one beamforming. 

Interestingly, the formulations of the optimization problems 
for single-group multicasting are similar to the multi-user 
downlink beamforming problems of ifS^ - llbTI . The latter 
works consider scenarios where each destination receives in¬ 
dividual signals and exploit uplink-downlink duality to derive 
algorithms that solve joint beamforming and power allocation 
problems. However, for single-group multicasting problems, 
these algorithms are not applicable due to the duality gap ifTsll . 

Recently, in the two independent works iia and na, 
rank-two transmit beamforming techniques for multicasting 
networks have been proposed in which also rank-two SDR 
solution matrices are feasible. In these techniques, two weight 
vectors are used at the transmitter to process two data symbols 
jointly. Rank-two beamforming techniques have gained much 
interest in the current research as the system performance is 
enhanced due to the increased number of degrees of freedom 
resulting from the additional weight vector HO), Ha-ell. 
The gain in performance comes at virtually no additional cost 
of decoding at the receiver and symbol-by-symbol detection 
can be applied by utilizing Alamouti’s orthogonal space-time 
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block coding (OSTBC) |[36l. Note that Alamouti’s code has 
been further developed in Ell-iol. Moreover, it has been 
combined with a variety of signal processing techniques for 
multi-antenna systems, as for instance, multi-user detection 
M, na, transmit beamforming with limited channel feed¬ 
back ia, na, receive beamforming Il45l . and interference 
alignment ll46ll . BtII . 

In this paper, we propose a distributed rank-two beam¬ 
forming scheme for single-group multicasting using a network 
of amplify-and-forward (AF) relays. In AF multicasting net¬ 
works, the relays forward common messages from a single 
source to multiple destinations. The proposed AF single-group 
multicasting scheme (AFMS) is a non-trivial extension of 
the transmit beamforming technique of ifTsl and CH to a 
distributed beamforming system. In such a system, the retrans¬ 
mission of the noise at the relays generally leads to correlated 
noise at the destinations, even when OSTBCs are used ll49l . 
Then, symbol-by-symbol detection is not optimal. Here, we 
design a distributed beamforming scheme which achieves 
uncorrelated destination noise. We refer to our scheme as 
the Rank-2-AFMS in distinction to the conventional Rank- 
1-AFMS of m. As another generalization to the Rank-1- 
AFMS, we exploit direct link connections from the source 
to the destinations. 

As the design criterion to select the power at the source and 
the relay weights, we aim to maximize the minimum QoS at 
the destinations under constraints on the transmit power in the 
network. We consider constraints on the maximum transmit 
power of the source, on the individual power of every relay, 
on the sum power of the relays, and on the total power by 
both the relays and source. 

To solve the non-convex max-min fairness optimization 
problem of jointly determining the relay weight vector and the 
power split between the relays and the source, we propose a 
linearization-based iterative algorithm. This algorithm belongs 
to the class of concave-convex procedure (CCCP) algorithms 
ll54l . In advantage to the SDR technique, where the number 
of variables is roughly squared, the number of variables is 
not increased for CCCP algorithms. CCCP algorithms are 
used to approximately solve non-convex difference of convex 
(DC) programming problems. Many optimization problems 
that arise in the context of wireless communications are DC 
problems, including power allocation Il23l - ll28l and beam¬ 
forming problems Il27l - ll^ . In contrast to the algorithms for 
max-min fair beamforming of HI, ini, m, ii, and llT4l , 
which treat solely the optimization of beamforming vectors, 
our algorithm derives the relay weight vectors and allocates 
transmit power to the source and the relays. The max-min 
fair beamforming approaches in na and IH combine the 
SDR technique with one-dimensional (ID) search on the 
maximum QoS. To compare our CCCP algorithm (Max-Min- 
CCCP) with the latter SDR-based algorithms, we combine the 
SDR technique with two-dimensional (2D) search on both the 
maximum QoS and the best power split between the relays 
and the source. 

To test the Rank-2-AFMS and the Rank-1-AFMS under 
realistic conditions, we use the channel model of 1481 . The 
simulation results demonstrate the performance of the pro¬ 


posed rank-two scheme combined with the proposed algorithm 
compared with the rank-one scheme of 13 and the theoretical 
bound obtained by SDR. The Max-Min-CCCP algorithm out¬ 
performs the SDR technique for high destination numbers at a 
much lower runtime. Moreover, the Max-Min-CCCP algorithm 
offers a good performance-runtime trade-off and achieves a 
minimum signaTto-noise ratio (SNR) which is less than 1 dB 
lower than the theoretical bound after three iterations. 

The contribution of this paper can be summarized as fol¬ 
lows: 

• The conventional Rank-1-AFMS of 13 is generalized 
to the Rank-2-AFMS, where the direct link connections 
from the source to the destinations are exploited. 

■ In extension of the previous works on rank-one beam¬ 
forming 13, HU, m, m, es-es, and on rank-two 
beamforming m, iia-113 for multicasting scenarios, 
we combine the beamformer design with power alloca¬ 
tion. 

• The rank-two beamforming technique of M and ED is 
generalized to a distributed beamforming system with a 
fundamental difference in the system model. 

• The proposed Rank-2-AFMS enjoys symbol-by-symbol 
maximum-likelihood (ML) detection due to the uncorre¬ 
lated destination noise. 

■ A rank-two optimization framework is established to 
develop the CCCP algorithm. Compared with the tradi¬ 
tional SDR approach the latter approach has the following 
advantages: 

- No additional searches for the optimum source power 
and the highest minimum SNR are required as the 
CCCP algorithm computes all parameters jointly. 

- The CCCP algorithm converges to a stationary point. 

• The simulation results demonstrate that the performance 
of the proposed system combined with the proposed 
CCCP algorithm is close to the theoretical performance 
bound. 

Notation: E{-}, | • |, tr(-), (•)*, (-j^, 5P{-}, and {■)^ denote 
the statistical expectation, absolute value of a complex number, 
trace of a matrix, complex conjugate, transpose, real part 
operator, and Hermitian transpose, respectively. Y A 0 means 
that Y is a positive semidefinite matrix, diag(a) denotes 
a diagonal matrix, with the entries of the vector a on its 
diagonal, blkdiag([Yi, Y 2 ,..., Ym]) is the block diagonal 
matrix formed from the matrices Yi, Y 2 ,..., Ym- Oat is the 
A^ X 1 vector containing zeros in all entries. On = diag(Ojv) 
and In is the NxN identity matrix, x ~ A/^(a, Y) means that 
X is circularly symmetric complex Gaussian distributed with 
mean a and covariance matrix Y. For the complex number 
2 ; = \z\e^'^, aig{z) denotes the phase (p. 7l(Y) denotes the 
rank of Y. 

II. SYSTEM MODEL 

Consider a wireless network of i? relays forwarding the 
signals from a single source to M destinations. In our single¬ 
group multicasting scenario, all destinations demand the same 
information. The source, the relays, and the destinations are 
single antenna devices, see Eig. [T] 
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Fig. I. Multicasting via the relay network; ig): source, o: relay, •: destination. 
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Fig. 2. Proposed Rank-2-AFMS; (g: source, o: relays, •: destinations. 


In the proposed Rank-2-AFMS, two data symbols are jointly 
processed in a four time slot scheme, see Fig. In the first and 
second time slot of the Rank-2-AFMS, the source transmits the 
data symbols si and S 2 , respectively, which are drawn from 
a discrete symbol constellation S. Both symbols are weighted 
by the same real-valued power scaling factor ai. The i? x 1 
vectors xi = ... ,xr^i^ and X 2 = ■ ■ • ,xr^ 2 ]^ of 

the received signals at the relays in the first and second time 
slot are respectively given by 


xi = faisi + X2 = fais^ + 773, ( 1 ) 

where = [ 7714 ,... and = [ 771 , 2 , ■ ■ • are 

the Rxl vectors of the relay noise of the first and second time 
slot, respectively, and where f = [/i,..., is the Rxl 
vector, containing the complex coefficients of the channels 
from the source to the relays. Note that all channels in the 
network are assumed to be frequency flat and constant over 
the four considered time slots. 

In contrast to the system model of ||9l, we consider that 
there exist direct channels from the source to the destinations. 
The signals ym,i and ym ,2 received by the mth destination in 
the first and second time slot, respectively, are given by 

7 /m,l — djfiCXiSi -f 77 ^, 1 , ym,2 — dmOilS2 “F ^m^2: (2) 

where dm is the coefficient of the channel from the source 
to the 777th destination and iXm.i and iXm,2 represent the 
destination noise of the first and second time slot, respectively. 

We make the practical assumptions that the noise processes 
in the network are spatially and temporally independent and 


complex Gaussian distributed. The noise power at the desti¬ 
nations is given by 

= ^2, 9 e {1,2, 3,4}, (3) 

and the noise at the relays is distributed according to 

^ J\f{OR,a'^lR), r]2 ^ Af{OR,a‘^lR), (4) 

where is the power of the noise at the relays. 

Here, the relays transmit their signals over two different 
beams. In this fashion, the proposed Rank-2-AFMS enables 
the relays to create two different communication links from 
the source to each destination. In the conventional Rank-1- 
AFMS of a, the relays transmit their signals over one beam, 
creating a single communication link from the source to each 
destination. 

The i? X 1 vectors ta = [fi, 3 ,..., and t 4 = 

[fi, 4 ,..., ffi, 4 ]^ of the signals transmitted by the relays in the 
third and fourth time slot, respectively, can be expressed as 

ta = Wixi+W2x;, t4 = -W2x{4-WiX2, (5) 

where Wi = diag(w{^), W 2 = diag(w|^), and Wi = 
[wi^i,.--,WR^i]’^ and W 2 = [7i’i,2, ■ • ■, ^^, 2 ]^ are the com¬ 
plex Rx 1 relay weight vectors. According to equations (|5]l, the 
relays transmit their received signal vectors Xi and X 2 over 
the two weight vectors Wi and W 2 . The use of two weight 
vectors increases the degrees of freedom in the distributed 
beamformer design. In general, the superposition of multiple 
symbols that are simultaneously transmitted over different 
beams leads to difficulties in the decoding at the destinations. 
In our proposed Rank-2-AFMS however, symbol-by-symbol 
detection at the destinations is facilitated by the particular 
spatial encoding scheme applied in equations dUl. The lat¬ 
ter encoding scheme corresponds to the popular Alamouti’s 
OSTBC scheme, in which two data symbols are transmitted 
over two spatial channels over consecutive time slots ll^ . 
In contrast to conventional OSTBC schemes, which do not 
require the availability of channel state information (CSI) 
at the transmitter, the Rank-2-AFMS creates two “artificial” 
spatial channels which are shaped by choosing two beams 
with corresponding beamforming vectors that are designed 
based on CSI. It is assumed that the perfect CSI is available 
at a central processing node that computes the source power 
and the relay weights and communicates each parameter to its 
corresponding node. 

Note that according to equations (|5]l, the signals transmitted 
by the relays consist of linear combinations of the received 
signals and their conjugates. This is also the case in distributed 
OSTBC schemes which, however, do not use CSI at the 
transmitter ll4^ . ll50l . 

To further exploit the direct links from the source to the 
destinations, the source transmits the signals aaSi +a 4 S 2 and 
—Q! 4 s{ + aas| in the third and fourth time slot, respectively, 
where and Q !4 are complex-valued scaling factors. 

The received signals of the third and fourth time slot at the 
7T7th destination can then be written as 

2/m ,3 — Sm^3 “F dmO^S^l “F dm^4:^2 “F 77^,a, 
ym^4 — §m^4 dmO^A^i “F dm^3^2 (6) 
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where 

Sm ~ [9m,l: • • ■ ; (7) 

is the R X 1 vector of the complex frequency flat channels 
between the relays and the mth destination and and 
denote the receiver noise at the mth destination in the third 
and fourth time slot, respectively. 

Introducing as the vector of the received signals at the 
TOth destination, n,„ as the vector of the noise at the mth 
destination, and Hm as the equivalent channel matrix and 
making use of equations ([T]i, Q, and (|7]i, the received signals 
of the four time slots in equations (|2]i and (|6]l can be compactly 
written as 


Ym — + rij, 


( 8 ) 


where 


r A r * * 

Ym — [yrn,l-,yrn,2^yrn,3->yrri,4\ f 


= [S1,S2]' 


A 

H-TT) - 




Ha_^, = 




^iG-mVl + wf Gm»72 
.-wi’G^J7i+wfG^r75 
0 

0 (aidm)* 

H-A.m 

^m,2 

— h* h* 

'‘m,2 

— CtiW^ Gjyjf “f 
dm,2 — CtiW2 Gjyjf “b OC4.dm-. 

Gm = diag(gm). 


^m,3 

^m.4. 


(9) 

( 10 ) 

( 11 ) 

( 12 ) 

(13) 

(14) 

(15) 

(16) 


Note that hm,i and hm ,2 denote the equivalent scalar channel 
coefficients that model the flat fading channels between the 
source and the mth destination that are obtained from relay 
beamforming using weight vectors Wi and W 2 , respectively. 
According to equation (fTSl l these channels form the matrix 
HA.m, which has the same orthogonal structure as the channel 
matrix of Alamouti’s OSTBC scheme. Interestingly, if no 
direct source-destination channels exist, i.e., dm = 0 for 
all m = 1 ,..., M and the destinations receive signals only 
in the third and the fourth time slot, the developed system 
model of equation ([8]l corresponds to the system model of 
US and IfT^ for centralized rank-two beamforming. However, 
for the latter schemes, the noise at the destinations can be 
modeled as temporally white. In contrast to the proposed relay 
beamforming approach we observe from equation (fTTI) that 
the third and the fourth entry of the noise vector n,„ both 
depend on the relay noise and 772 in the first and the second 
time slot, respectively. However, due to the specific orthogonal 
transmission format corresponding to the proposed AFMS it 
can readily be verified that the covariance matrix of the noise 
vector in equation (fTTl i exhibits a diagonal structure of the 
form 

E{n„n^} = blkdiag ([cr2l2,(T^_34l2]) , (17) 


where 

f^m,34 - 0mWi -f wf 5^W2) -f (T^, (18) 

Gm = G™G^ = diag([|gm,ip,...,|g,„,fip]). (19) 

The spatially and temporally uncorrelated noise property ex¬ 
pressed in equation (fTTI) is essential for the use of simple 
symbol-by-symbol detection at the destination, which achieves 
Maximum-Likelihood (ML) performance as will be shown 
in the following. To account for the difference in the noise 
powers of the different time slots due to noise amplification 
at the relays, we transform both sides of equation dSll by 
multiplication with the diagonal scaling matrix 

r 4 blkdiag ([(a^. 34 /a.)l 2 , 12 ]), (20) 

resulting in the equivalent model representation 

Tym = THmS -b rn„j, (21) 


corresponding to a uniform noise power model, i.e., rn^ 
is given by E |(rn„) (rn„)^| = cr^_ 34 l 2 . Then, using 
equation (l2Tl i. the ML detection problem of finding s can be 
equivalently reformulated as the least squares problem 

min llTy^-TH^sf. (22) 

sG»Sx5 

From equations (fT^ and (l20l i. we observe that H^T^THm = 
c^l 2 , where 


c= \J[al\dm?(Tl,^^i)/al + \hm,l? + \hm, 2 ?. 

Let US define the matrix 11 = (l/c)[rHm, Z], where the 
matrix Z is chosen such that n^II = I 2 , i.e., equation 
can be equivalently written as 

2 

s 


mm 

s€»Sx5 


U^Tym - c 


= min c^\\sm - s|p 
seSxS 


I 2 

O 2 J 
constant 


where 


and 


Sm = Bym = s -b Bn„, 


(23) 

(24) 


B 4 (l/c)2(rH^)"r. (25) 

From equation (|2^ . the detection of si and S 2 decouples into 
two scalar detection problems since 
.2 


■AT 


S,cr, 


^h), 


^2 A 
— 


2 2 
^m,34^v 


(26) 

(27) 


which follows directly from equations (fTTI) and (|25]) . In other 
words, in the proposed four phase scheme ML detection 
reduces to simple symbol-by-symbol detection. We remark 
that the diagonal structure of the error covariance matrix 
follows from the orthogonal encoding in equations (|5]l. 

From equation (|26] |. we see that the SNR in is given by 


SNR„ = 4 = 5% 

crt erf. 


\hm,l\^ 


IA 


m,2 \ 


2 

^m .34 


(28) 
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for both data symbols. For the sake of convenience, let us 
introduce the following vector notation 


wi = [wi, W2 = [w^, {uiaiYf', 

R„i = blkdiag([Rm,Rm]), 

Rm = blkdiag([CT^C/m,0]). 

Using equations (fTSl l and ( |29] | - ( |32] | yields 

O'm.34 = wf R™Wi + wf R„iW 2 + 

= w^R„iW + al- 


(29) 

(30) 

(31) 

(32) 


(33) 

(34) 


Let us furthermore introduce 


Qm = blkdiag([Qm,i, Qm, 2 ]), (35) 

Qm,l ^ ^rn,l^rn,li Qm,2 — Qm,2Qm^25 (36) 

Qm.l = [(Gmf)^,dm]^, (37) 

a = l/|aip, (38) 

where a is a power scaling factor. With equations (fT4l i. ( fTST i, 
and - d^ . we have 

\hm,lf + \hm, 2 f = (wfQm^iWi + wfQ„, 2 W 2 ) /a 
= w-^Qmw/a. 


Using the above identity together with equations d33] l and 
d34l i. we reformulate the SNR given in equation d28l l in vector 
notation as 


SNR 

m (wi, W2,a) 

_ wf Q^^Wi + wf Q^, 2 W 2 

(wf R™wi + wf R,„W 2 + al)a 


or, equivalently, as 

SNRm(w,a) = 


w-^QmW 


1^77 


1^77 


(w-f^RTTjW + al)a 


(39) 


(40) 


III. Beamformer Design and Power Control 

In this section, we derive an algorithm to design the weight 
vectors and to distribute the power between different time 
slots and between the source and the relays. We consider 
the problem of maximizing the minimum QoS measured 
in terms of the SNR at the destinations subject to power 
constraints. The power constraints include thresholds on the 
individual power of each relay and the source, the sum power 
of the relays, and the total power of the network. Maximizing 
the minimum SNR is a practical objective, e.g., for packet- 
data traffic with full buffer networks the receivers demand 
the largest feasible data rates rather than a certain constant 
data rate, provided that the average rate is satisfactory ED- 
Hence, max-min fairness is a common design criterion that 
has been used in m, in, im, El, 1221, and El. Here, 
the corresponding max-min fairness optimization problem is 
formulated as 


V: 


max min SNR 771 

(w.a) 

s.t. (w,a) G ft, 


where w and a belong to the set 

ft = {w,a I w,a satisfy (HUi } 
characterized by the following power constraints 

positivity; a > 0, (41a) 

individual relay power; Pr(w, a) < Pr,max Vr G {1,..., i?}, 

(41b) 

R 

relay sum power; E Pr(w, a) < Pr 

,max; (41c) 

r—1 

source power; Ps{'w,a) <Ps ,max: (41d) 

total power; Pt(w, a) < Pt, max- (41e) 


The positivity condition drTal results from the parameter 
transformation in definition (El and ensures that the source 
power is a real and positive number. In inequality (I41bb . 
Pr{w,a) represents the transmit power of the rth relay, 
Pr-.max IS the maximum individual power value for the rth 
relay, Pf(w, a) in inequality (I41cl i is the sum power 

transmitted by the relays in one time slot for which the power 
threshold value Pn^max applies, Pr (w, a) in inequality (I41dl) 
is the total transmitted power at the source in four consecutive 
time slots with threshold value Ps,max, Pt{'^, a) in inequality 
(141 el l is the total transmit power of the network, including 
source and relay powers, during four time slots, and Pt, max 
is the threshold value for Pt(w, a). 

Using equations ([T]) and (|5]l, the power p 7 .(w, a) transmitted 
by the rth relay in the third time slot can be derived as 

P7-(w,a) = E{\tr,3\'^} 

= E{\aiW*^^frSi + alw*,2frS2 + W*2V*^2\'^} 

= (kr.lP + |Wr, 2 p) (|ai/7.p + Cr^) 

= ■wYtir'^l/a + W^D7.W2/a -f wf^E^Wi -f W^ErW2 
= w^Dt-w/u + w^E^w, (42) 

where the {R + 1) x {R + 1) matrices D 7 . and E 7 . have 
respectively \fr\'^ and cr^ as their rth diagonal entry and 
zeros elsewhere, and where = blkdiag([D 7 ., D 7 .]) and 
E 7 , = blkdiag([Er, E 7 .]). In equations (l42li . we have used 
the assumption that the data symbols are independent and 
identically distributed with zero mean and unit variance. Due 
to the symmetry in the transmission scheme, the relay power 
in the fourth time slot is equivalent to the relay power of 
the third time slot, i.e., pr{w,a) = = E{\tr, 4 ,\‘^}. 

Hence pr{w,a) represents the relay power consumed in each 
time slot in which the relays transmit. Note that the inequality 
constraints (I41bl i are convex as pr{'w,a) in equations (l42l) is 
expressed as the sum of the convex quadratic form w^Et-w 
as well as the fraction of the convex quadratic form w^Dt-w 
and the linear term a, which is a convex function ll52l . The 
same holds true for the condition a) < Pr, max in 
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inequality ( 141 cl ), as the summation of convex functions yields 
a convex function. The transmit power of the source during 
the four time slots is given by 

Ps(w,a) = E{\aisi\^} + E{\ais*2\^} + E{\azSi + 0452^} 
+ E{ I — 01431 + 0^3 S2 P} = 2/a + 21013 p + 2|Qf4 p 
= 2 /a + wf^Swi/a + w^Sw2/a 
= 2 /a + w^Sw/a, ( 43 ) 

where S is an (i? + 1) x {R + 1) matrix, having 2 as its 
(i? + l)th diagonal entry and zeros elsewhere and where S = 
blkdiag([S, S]). Note that Ps{'w,a) is a convex function of 
w and a. 

As the sum powers of the relays of the third and fourth time 
slot are equal, the total transmit power of the relays and the 
source during four time slots amounts to 

R 

Pr(w,a) = Ps(w,a) + 2^Pr(w,a). (44) 

r— 1 

With equations (02]) - (04l ). the power constraints (I41al) - 
( 0 Ta can be reformulated as 


a > 0, 

(45a) 

W^DrW/a 4-W^ErW < Pr.max VrG{l,.. 

.,P}, (45b) 

R 

(w^D^w/a -b w^Erw) < Pft.niax, 

F — 1 

(45c) 

2/a -b W^Sw/a < Ps,max, 

(45d) 

R 

2/a-b w'^Sw/a-b2y}^ (w^Drw/a-b w^E^. 

w) < Pt, max, 

r—1 

(45e) 


respectively. Note that V represents the optimization problem 
in its general form. We remark that the optimization proce¬ 
dures developed in the following for problem V remain valid 
if some constraints ( I41bb - ( 141 el ) are removed. 

Introducing the auxiliary variable t, problem V can equiv¬ 
alently be written as 


M : 

min t 

w,o,t 

s.t. f > 0, 

SNRm(w, a) > l/< Vm G {1,..., M}, 
(w, a) G n, 


where 1/t represents the minimum SNR at the destinations. 

The difficulty associated with solving problem Ai lies in 
the SNR constraints which can be formulated as 

t a ~ ’ 

(46) 

where we have used the SNR expression of equation ( 051 ). 
Due to the negative term on the left hand side of the above 
inequality, the SNR constraints are non-convex in general. 


A. Rank-two property and relation of the Rank-2-AFMS to the 
Rank-l-AFMS 

The Rank-l-AFMS can be regarded as a special case of the 
Rank-2-AFMS where symbols are transmitted sequentially by 
a single beamformer, i.e., choosing W 2 = In the Rank- 

l-AFMS, each symbol is communicated in two time slots; in 
the first time slot the source sends the signal to the relays 
and in the second time slot the relays forward their received 
signals to the destinations. Note that the number of time slots 
used to transmit one data symbol is two for the Rank-l-AFMS 
(W 2 = Ofl+i) and the proposed Rank-2-AFMS (w 2 f 0/j+i). 

In the following we analyze the Rank-l-AFMS and the 
Rank-2-AFMS, applying SDR to problem A4. We will prove 
that for both schemes, the SDR versions of A4 are equivalent. 

Let us derive an equivalent representation of the SNR 
at the mth destination given by equation ([39]). Defining 
the matrix A = ,... ,1]), where pr — 

arg(/r), we notice from the definitions dTS] ) and ([36]) that 
Qm,i = A^Qm, 2 A. The unitary transformation W 2 = Aw 2 
exhibits the useful property that W 2 Qm, 2'^2 = '^2 Qm,i'^ 2 - 
Moreover, w|^R„iW 2 = w|^R,„W 2 , which follows from the 
definition of A and from equations ([T^ and ([32]). Then, using 
equation ([39] ), the SNR constraints (06]) can be formulated as 

wf Qm,iwi -f wf Qm,iW 2 > 

{j - (wf R„wi -f wf R™W 2 -b al). 

Due to the quadratic forms wf^Q^.iWi and w|^Qm 1 W 2 , 
the above inequality describes a non-convex set. Non-convex 
problems are difficult to solve and generally NP-hard. To 
reformulate non-convex QCQPs, the SDR technique has been 
proposed in the literature 113, na. In the latter technique, 
the identity wf^Qm.iWi = tr(wiwf^Qm.i) is exploited and 
wiwf^ is substituted in A 1 by a positive semidefinite matrix 
Xi. Substituting further W 2 W^ by X 2 leads to 

min t 

Xi 5X2 5R 

s.t. f > 0, 

tr((Xi + X 2 ) Qm,l) > 

(a _ ^ X 2 )R™) + al) 

Vm G {1,..., M}, 

(Xi+X 2 ,a)GT, 

Xi^0,X2^0, (47) 

where the constraints rank(Xi) = 1 and rank(X 2 ) = 1 are 
neglected and where the set 

{X,a I X,a satisfy 08])} 
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is defined by the power constraints 

a > 0, (48a) 

tr(X(D 7 -/(I + E^.)^ Pr,max Vr e (48b) 

R 

tr(X(Dr-/a + Er-)) < PR,max, (48c) 

r—1 

2/a + tr(XS)/a<Ps,max, (48d) 

R 


2/a + tr(XS)/a + 2 tr(X(Dr/a + E^)) < PT,max- 

r—1 

(48e) 

Here the constraints (148ab - (I48eb correspond to constraints 
(145ab - ( I45eb . respectively. Note that for the Rank-1-AFMS 
W 2 = 0/j+i follows from W 2 = Orj+i and X 2 = O/j+i holds 
true. For the Rank-1-AFMS, the SDR problem corresponding 
to problem (1471) is given by 

min t 

Xi ,t,a 

s.t. f > 0, 

tr(XiQ™,i) > (^^ - ^^(tr(XiR™)+a^) 

Vm € {1,..., M}, 

(Xi,a)eT, 

Xi ^ 0. (49) 

Theorem 1. (Equivalence of SDR problems) The optimum 
value t* of the problem ( 147b corresponding to the Rank-2- 
AFMS is the same as for problem ( 149b corresponding to the 
Rank-1-AFMS. 

Proof: Let (X^,f*,a*) be a solution to problem ( l49b . 
then (X^,X 2 = Ofl+i,f*,a*) is a solution to problem 
as the constraint functions in problem Wt\ only depend on the 
sum of Xi and X 2 . On the other hand, if (X{, X 2 , t*, a*) is a 
solution of problem (|47] |. then (X^ -|-X 2 ,f*,a*) is a solution 
of problem ( |49] | as the sum of positive semidefinite matrices 
results in a positive semidefinite matrix. ■ 

As a consequence of the latter theorem, the SDR of AA for 
the Rank-1-AFMS given by problem ( |49] | and the SDR of j\4 
for the Rank-2-AFMS given by problem (l47l i are equivalent. 
For the Rank-2-AFMS, a feasible solution for j\4 can be 
computed from the eigendecomposition of X* if 7^(Xi *)<2 
as will be shown in the following. 

Let Xi* be a solution of problem ( |49] | with 7Z(Xi*) = 2 
and let X* = AiUiu{^ -f A 2 U 2 u|^ with non-zero eigenvalues 
Ai and A 2 and the respective eigenvectors Ui and U 2 . Then, the 
solution X* of the relaxed problem is a feasible global solution 
of V. The decomposition of X^ into two components is not 
unique and can be obtained from any vector pair wi and W 2 
satisfying X^ = wiwf^ + W 2 w|^. This is different from the 
Rank-1-AFMS approach where a single beamforming vector 
Wi is computed and where the SDR solution is only feasible 
if 7?.(Xi*) = 1. In the proposed Rank-2-AFMS, the number 
of degrees of freedom is increased due to the introduction of 
two linearly independent weight vectors. 

We remark that problem ( |49] | still represents a non-convex 
problem, as the fractions of two linear terms in inequalities 


(I48bl i, ( I48cb . and (I48eb represent non-convex functions. More¬ 
over, the multiplication of the two linear terms in the SNR 
constraints ( |49] | results in a non-convex function. 

However, for constant t and a, the feasibility problem to 
compute a feasible matrix Xi is a convex SDR In llT4l . the 
authors have proposed to perform a ID bisection search to 
find the optimum t for a problem similar to problem (|4^ 
in the context of transmit beamforming. In the latter search 
procedure, the feasibility problem is solved in each iteration. 
Here, in order to solve problem ( l49l l. a 2D search is required 
since there is an additional power scaling factor a involved. 
Note that the optimum power allocation factor a* cannot 
be found by bisection search as it is a-priori not possible 
to determine an appropriate search interval. Therefore, we 
propose to perform grid search on a. For each point on the 
search grid, an additional bisection search according to 01 on 
the optimum t is performed. We refer to this search procedure 
as the SDR2D algorithm. 

One drawback of the SDR2D algorithm is its computational 
burden to solve an SDP in each iteration of the exhaustive 
2D search. Another drawback is that 7?.(X{) > 2 in general. 
In this case, the corresponding optimal value t* is only a 
lower bound for A4 (within the grid search precision) as X| 
is not feasible for problem V since there exists no rank-two 
decomposition X^ = wiw^ -(- W 2 w|^. For cases in which 
the SDR solution is not feasible for the original problem, 
randomization techniques have been applied in ifTTl - lfTSl to 
generate feasible points that are suboptimal, in general. 

Besides the SDR-based algorithms to compute vectors, 
recently, iterative algorithms have been developed which out¬ 
perform the SDR-based randomization algorithms in terms 
of performance and computational complexity 0 - 1101 , na. 
The latter algorithms perform an inner approximation of the 
original non-convex problem and maintain the number of 
variables, whereas in the SDR technique, the number of 
variables is roughly squared. 

In the rest of this section, we develop an iterative algorithm 
which computes the weight vector and adjusts the source 
power to maximize the minimum received SNR at the des¬ 
tinations. 

B. Convex inner approximation technique 

In the max-min fairness beamforming problem 
AA, the left hand side of the constraints (l46l) 

is the difference of (w^R^w Fal)/t and 

(w^(Q^-f (|(imP/cr2)R„)w-f As R„ and 

Qm are positive-semidefinite matrices, these functions are 
both convex since they consist of a convex quadratic form 
divided by a linear term and a constant divided by a linear 
term ll52l . Therefore, the left hand side of inequality (l46l l is a 
difference of two convex (DC) functions and Jvi belongs to 
the class of DC programs ll53l - ll57l . 

To solve Jvi approximately, we propose an iterative algo¬ 
rithm which generates a sequence of weight vectors with 
iteration index fc € Nq. At iteration k-\-l, the subsequent vector 
of is generated according to 

^(k+l) A ^{k) 


(50) 



using the update vector Aw as further specified below. Sim¬ 
ilarly, and are the optimization variables a and t at 
the fcth iteration, respectively. They are updated according to 

al'^+i) = + Aa, A i(fc) + Af, (51) 


where Aa and Af are the update variables as given below. 

Let us assume that and represent fixed 

feasible points of problem and let Aw, Aa, and At denote 
optimization variables. If we replace w in A1 by w^^^ + Aw, a 
by a*-^^ + Aa, and t by t^^^ + At, the power constraints defined 
by the set ft remain convex. To derive a convex approximation 
of the constraints (l46l) let us replace the concave part by its first 
order Taylor approximation around , t^^'^), resulting 

in the convex constraint 


AW(Aw,Aa,At) A 


(w^'^) -p Aw)^R„(w('=) -f Aw) + a] 


(k) 


+ At 

{Qm + {\dm\‘^+\dm\‘^ ( Aa 


7^) 


7(fc) 


-1 


25R{Aw^(Q„ + (|dmp/cr^)R™)w('=)} 




< 0 . 


(52) 


In a, a similar approximation has been used for max- 
min fair beamforming in bi-directional relay networks. The 
approximation made in inequality (|5^ is however tighter than 
that of a in the sense that unlike the approach of a, the 
convex quadratic-over-linear terms are not linearized. In 1^ . 
a linearization approach for max-min fair beamforming in the 
context of cognitive radio networks has been proposed. The 
latter approach assumes a centralized system and therefore 
does not consider the problem of power allocation which arises 
in our distributed beamforming application. Power allocation 
and beamforming optimization have been addressed in ll28l . 
where a CCCP algorithm has been proposed to minimize the 
transmit power in a cooperative relay network. An iterative 
algorithm for power minimization in a Rank-2-AFMS has been 
proposed in our accompanying conference paper ifTol . In the 
referenced work, however, direct source-destination channels 
and power allocation have not been regarded. 

Comparing inequalities (l46l l and (l52T i. we find that 


-b Aw, a^^^ + Aa, -I- At) < A^^ (Aw, Aa, At), 

(53) 

which is a consequence of the linearization of the concave 
part of Am(w,a,f), see Section 3.1.3 in ll5^ . Therefore, the 
convex problem 


min -b At 

Aw, Aa, At 

s.t. A«(A w, Aa, At) < 0 Vto g { 1 ,..., M}, 

t(k) + Af > 0, 

-b Aw, -b Aa) G fl, (54) 


represents an inner approximation of Ai. 

Corollary 2. Let {wd^\ad^\td^'>) be feasible for Ai. The 
updated variables obtained from a 

solution {Aw*, Aa*, At*) to problem ( 1541 ), are feasible for 
Ai and < t'^^y 


Proof: At* < 0 and consequently < t'Al hold 

true as (Aw = 0, Aa = 0, At = 0) is feasible for prob¬ 
lem (|54] | since (w^^'f a(^'\ t(^'^) is feasible. From inequality 
(l5^ follows Am(w('"'+^\ a(^+^\ < 0 and, conse¬ 
quently, the SNR constraints are fulfilled. As > 0 and 

a*^^+^^) G fl are ensured according to problem (l54l i. 
the feasibility of ) for A4 is guaranteed. 

■ 

The property ensures that the minimum 

SNR increases or remains unchanged in each iteration. Re¬ 
peatedly solving problem dSli for k = 0 , 1 , 2 ,... creates 
a monotonically non-decreasing sequence of minimum SNR 
values < \/t^^'> < 1 /A^A.. with feasible weight 

vectors and a feasible transmit power. Interestingly, we can 
show that the latter update iteration does not exhibit diver¬ 
gence or oscillation. Moreover, the sequence {w^^\ ad^\t^^'>) 
converges globally, i.e., for every feasible start point, to a 
stationary point of Ai that satisfies the Karush-Kuhn-Tucker 
(KKT) conditions. 0 For non-convex optimization problems, 
a KKT point can be a local optimum, a global optimum, a 
maximum, or a saddle point ll52]| . It is easy to verify that 
for Ai, a KKT point cannot be a maximum: For any point 
(w, a, t), reducing the source power by choosing a larger 
power scaling factor a leads to a larger target function value 
t. We remark that it is in general NP-hard to prove the local 
optimality of a KKT point ll58ll . Even though a proof of 
optimality of a KKT point is difficult to derive, the search 
for KKT points has been found useful to solve non-convex 
optimization problems 1651 . 1661 . 

Theorem 3. Let us assume that the power factor a is 
bounded by Umax A o- Then, for any feasible initial point 
{wd^\a (o)^i(o))^ ffie sequence (w (fc),a(fc),i(fc)) 

converges to 

a stationary point. 

Proof: According to Theorem 10 of fsH, the se¬ 
quence (w(^\ a*-^\ globally converges if the mapping 
(w(^\ a(^\ f(^^) —>■ (w(^+^\ of the definitions 

(l50l) and (fSTl l is uniformly compact. This is the case if the 
feasible set of Ai is compact l54l . We show that the variables 
lie inside a compact set, i.e., a set that is closed and bounded: 
Using our assumption and the constraint on the source power 
(l45dll . a lies in the interval 2/Ps,max A a A Omax which is 
closed and bounded. Moreover, due to the power constraints 
(l45bll - (l45eli . it is clear that the set of feasible weight vectors 
w is bounded and closed, t lies in the closed and bounded 
interval fopt A i A where fopt is the (unknown) optimum 
value. ■ 

For the latter proof, we have assumed that a < Umax which 
implies that the source power during the first two time slots 
does not vanish. Adding a < Umax as an additional constraint 
to Ai is not a critical requirement provided that a^ax is 
chosen sufficiently large. Then, the approximated problem (l54l) 


*For points where constraint qualification holds, the KKT conditions are 
necessary for local optimality. Note that for convex optimization problems, 
the KKT conditions are sufficient to guarantee global optimality if Slater’s 
constraint qualification is satisfied (m 
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becomes 

min + At 

Aw, Aa. At 

s.t. A«(A w, Aa, At) < 0 Vm G {1,..., M}, 

^ + Act ^ Ctniaxj 

t(fe) + Ai > 0, 

+ Aw, + Aa) G n. (55) 

The globally convergent Max-Min-CCCP algorithm out¬ 
lined in Algorithm 1 starts at a random point and iterates until 
the relative progress p A |t(^) — ^('=+ 1 ) falls below the 
threshold value e. 


Algorithm 1: Max-Min-CCCP 

1 begin 

2 Set k := 0. Create and scale random ^ 02^+2 
and ^ 0 such that {'w^^\a^^'>) G n. Compute 

= l/(minSNRm). Set p > e. 

3 while p > e do 

4 fc := fc -I- 1. 

5 Compute (Aw*, Aa*, AC) by solving problem 

(l55l l. using update 

^(fc+i) _ ^(fc) _|_ = a^^) + Aa* and 

^(fc+i) = f{k) 

6 p := |C - C+i|/C. 

7 end 

8 end 

9 return 


As the proposed algorithm is based on the linearization of 
the concave (negative convex) functions of a DC program, it 
belongs to the class of CCCP algorithms ||5^ . 

Note that in our algorithm, the problem (l55l l is solved 
exactly. To reduce the computational cost, it is possible to 
use an inaccurate solution Il22ll . A detailed description of 
an implementation is beyond the scope of this work. In our 
simulations, the subproblems that arise in every iteration of 
our proposed algorithm are solved exactly. 

IV. Simulation Results 

To evaluate the performance of the proposed scheme under 
realistic conditions, we consider a relay network, where the 
coefficients of the source-relay, relay-destination and source- 
destination channels model an urban micro scenario ll48l . The 
system parameters are chosen according to the Long Term 
Evolution (LTE) standard for mobile communication llMl.The 
system is operated at a carrier frequency of 1800 MHz and 
we choose Tg = 66.7psec as the duration of one time slot. 
The bandwidth is given by l/Tg which corresponds to the 
bandwidth of a subcarrier in a multi-carrier LTE system. We 
further assume frequency flat fading channels. We create the 
channel coefficients such that there is no shadow fading from 
the source to the relays but from the source and relays to 


the destinations. The noise power is set to cr^ = cr^ = —132 
dBm. The maximum transmit power values are chosen as 

L*T,max5 L*5niax — max/2, Pji^n\ax — L^Tniax/8, and 

Pr,max = -Pt, max/15, respectively. 

In the network, i? = 10 relays are placed at a distance 
of 250 meters around the source at equidistant angles. The 
destinations are randomly distributed in between 600 and 800 
meters around the source, see Eig. [3 The source, the relays, 
and the destinations are placed at a height of 10, 5, and 1.5 
meters, respectively. 

We investigate the performance of the following transmis¬ 
sion schemes, scenarios, and algorithms: The Rank-2-AEMS 
combined with the Max-Min-CCCP algorithm (R2-Max-Min- 
CCCP) and the SDR2D algorithm (R2-SDR2D). Eurthermore, 
we consider the Rank-l-AEMS combined with the CCCP 
algorithm (R1-Max-Min-CCCP) and the SDR2D algorithm 
(R1-SDR2D). Moreover, we investigate also direct source- 
destination (DSD) communication, where the relays are not 
involved. All results are compared with the theoretical upper 
bound (SDR2D-UB) obtained by the SDR2D algorithm, see 
Subsection IIII-AI 

Eor the SDP-feasibility problems, which are solved in the 
SDR2D algorithm to find a solution of problem (l49l l. and to 
solve the subproblems (fSSl) of the Max-Min-CCCP algorithm, 
we have utilized the cvx interface for convex programming in 
a Matlab environment ll62ll . As the solver, we have chosen 
Mosek 7.0.0.103 under the default precision ||63l. Eor the 
SDR2D algorithm, we search over 200 grid points to determine 
the power scaling factor a and select e = 10 “^ as the precision 
in the bisection search to determine the optimum t. We chose 
e also as the threshold value for the relative progress of the 
Max-Min-CCCP algorithm. We select Umax = 2 • 10®/Ps,niax 
in the problem (l55l) that is solved in every iteration of the 
Max-Min-CCCP algorithm. 

In the case that the solution matrix to the optimization 
problem ( |49] ) obtained by the SDR2D algorithm is not feasible 
for A4, we apply the Gaussian randomization procedure of 
ifTfill . Eor the Rank-l-AEMS, we select the best out of 200 
random vectors, where each vector is properly scaled to meet 
the power constraints. Eor the Rank-2-AEMS, we create 200 
random vector pairs. Each pair is properly scaled by solving a 
linear program using the function linprog. m of the Matlab 
optimization toolbox lIMll . 

In our first example, we examine the performance of 
all setups in terms of the average minimum achieved rate 
by solving Ai. The minimum achieved rate is given by 
(l/ 2 )log 2 (l + SNR) (the prefactor 1/2 takes into account 
the time slots per communicated symbol) for communications 
with relays and by log 2 (l + SNR) for DSD, as one symbol 
per time slot can be communicated. 

Fig. a depicts the average minimum rate versus Hr.max 
in the case that the direct source-destination channels are 
exploited for M = 100 destinations. In our simulation re¬ 
sults depicted in Eig. |5] it is assumed that no direct source- 
destination channels exist. 

Both figures demonstrate that R2-Max-Min-CCCP achieves 
near-optimum performance close to the theoretical upper 
bound. The Rank-l-AEMS is clearly outperformed by the 
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Fig. 3. Setup of the network; 0: source, o: relay, •: destination. 


M 

10 

40 

70 

100 

130 

Average Tt(K*) 

1.8 

2.5 

3.0 

3.3 

3.4 


TABLE I 

Average rank 7^(XJ) versus number of destinations M. 


Rank-2-AFMS. Comparing Fig. |4] with Fig. |5] we observe 
that the performance improves significantly if the source- 
destination channels are exploited. 

Fig. |6] depicts the average minimum rate versus the number 
of destinations M in the case that the direct source-destination 
channels are exploited for Pt, max =5dBm. In Fig. |7] the 
same setups are considered, assuming that no direct source- 
destination channels exist. 

Both Fig. |6] and Fig. [T] demonstrate that R2-Max-Min- 
CCCP achieves near-optimum performance close to the upper 
bound. For small M, the SDR2D algorithm performs slightly 
better than the Max-Min-CCCP algorithm due to the fact, 
that in most cases, the respective solution matrix has a rank 
smaller or equal to two. As illustrated in Table |I] the rank of 
the SDR solution matrix increases with increasing M. This 
leads to suboptimal solutions generated by the randomization 
technique. 

In our second example, we examine the computational 
aspects of the considered algorithms for M = 100 desti¬ 
nations and Pt, max =20dBm. Table HI] shows the average 
runtime in seconds of the different algorithms. We observe 
that the Max-Min-CCCP algorithm is more than ten times 
faster than the SDR2D algorithm. Fig. [8] depicts the number 
of iterations of the Max-Min-CCCP algorithm for the Rank- 
2-AFMS versus the minimum SNR. In each simulation run, 
we have initialized the R2-Max-Min-CCCP method with ten 
different starting points. In Fig. the highest (R2-Max-Min- 
CCCP FI) and the lowest minimum SNR (R2-Max-Min-CCCP 
L) achieved within these ten optimization runs of the Max- 
Min-CCCP algorithm are depicted for each iteration. As a 
comparison to the CCCP algorithm, the performance of the 
SDR-based approaches is depicted. To analyze the influence 
of the randomization procedure on the performance of the 
SDR2D algorithm, we increase the number of random vectors 


Algorithm 

SDR2D, Rl-Rand 

SDR2D, R2-Rand 

Rl-CCCP 

R2-CCCP 

Runtime 

1.0 ■ 103 

1.1 ■ 10^ 

81 

83 


TABLE II 

Average runtime in seconds . R1 -Rand: Rank-one 
RANDOMIZATION, R2-RAND: RANK-TWO RANDOMIZATION, R1 -CCCP: 
RI-Max-Min-CCCP, R2-CCCP: R2-Max-Min-CCCP. 



Fig. 4. Minimum rate versus total power Pt, max. first example. 


from 200 to 1000 for the R1-SDR2D method (R1-SDR2D 
1000) and for the R2-SDR2D method (R2-SDR2D 1000). As 
expected, the increased number of random vectors leads to a 
higher minimum SNR, however, the R2-Max-Min-CCCP still 
outperforms the SDR-based designs. Furthermore, we observe 
that the main progress of the Max-Min-CCCP algorithm is 
achieved within the first three iterations. In the following 
iterations, the gain that is obtained is less than IdB. Since 
the Max-Min-CCCP algorithm exhibits excellent performance 
within a few iterations it is suitable for real time applications. 
The difference between the highest minimum SNR and the 
lowest minimum SNR achieved with ten different starting 
points decreases with the number of iterations. For more than 
nine iterations, the difference is less than 0.5 dB. We remark 
from this observation that the performance can be improved 
if the R2-Max-Min-CCCP is initialized with different starting 
points. 


V. Conclusion 

In this paper, a novel Rank-2-AFMS for single-group 
multicasting has been proposed. Our scheme generalizes the 
rank-one multicasting scheme of the literature to a rank- 
two multicasting scheme and allows to incorporate the direct 
channel from the source to the destinations in the detection. To 
select the proper source power and to adjust the relay weights 
we propose an iterative algorithm to maximize the lowest SNR 
at the destinations. The simulation results demonstrate that the 
proposed Rank-2-AFMS combined with the proposed iterative 
algorithm yields a performance close to the theoretical upper 
bound. 
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Fig. 5. Minimum rate versus total power Pjp max^ first example. 



Fig. 6. Minimum rate versus number of destinations M, first example. 



Fig. 7. Minimum rate versus number of destinations M, first example. 
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Fig. 8. Minimum SNR versus number of iterations, second example. 
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